Using Syntax to Disambiguate Explicit Discourse Connectives in Text
نویسندگان
چکیده
Discourse connectives are words or phrases such as once, since, and on the contrary that explicitly signal the presence of a discourse relation. There are two types of ambiguity that need to be resolved during discourse processing. First, a word can be ambiguous between discourse or non-discourse usage. For example, once can be either a temporal discourse connective or a simply a word meaning “formerly”. Secondly, some connectives are ambiguous in terms of the relation they mark. For example since can serve as either a temporal or causal connective. We demonstrate that syntactic features improve performance in both disambiguation tasks. We report state-of-the-art results for identifying discourse vs. non-discourse usage and human-level performance on sense disambiguation. Disciplines Computer Sciences Comments Pitler, E. & Nenkova, A., Using Syntax to Disambiguate Explicit Discourse Connectives in Text, 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Aug. 2009, doi: anthology/P09-2004 This conference paper is available at ScholarlyCommons: http://repository.upenn.edu/cis_papers/723
منابع مشابه
Inducing Discourse Connectives from Parallel Texts
Discourse connectives (e.g. however, because) are terms that explicitly express discourse relations in a coherent text. While a list of discourse connectives is useful for both theoretical and empirical research on discourse relations, few languages currently possess such a resource. In this article, we propose a new method that exploits parallel corpora and collocation extraction techniques to...
متن کاملAcquiring a Disambiguation Model For Discourse Connectives
Discourse connectives can show sense ambiguities, in that they can signal more than one possible rhetorical relation. The aim of this study is discover how to disambiguate such discourse connectives using a statistical model. Six discourse connectives (after, as soon as, before, once, since and while) which show ambiguities in the sdrt (Segmented Discourse Representation Theory (Asher & Lascari...
متن کاملExploiting Discourse Relations for Sentiment Analysis
The overall sentiment of a text is critically affected by its discourse structure. By splitting a text into text spans with different discourse relations, we automatically train the weights of different relations in accordance with their importance, and then make use of discourse structure knowledge to improve sentiment classification. In this paper, we utilize explicit connectives to predict d...
متن کاملDiscovery of Ambiguous and Unambiguous Discourse Connectives via Annotation Projection
We present work on tagging German discourse connectives using English training data and a German-English parallel corpus, and report first results towards a more comprehensive approach of doing annotation projection for explicit discourse relations. Our results show that (i) an approach based on a dictionary of connectives currently has advantages over a simpler approach that uses word alignmen...
متن کاملCrosslingual Annotation and Analysis of Implicit Discourse Connectives for Machine Translation
Usage of discourse connectives (DCs) differs across languages, thus addition and omission of connectives are common in translation. We investigate how implicit (omitted) DCs in the source text impacts various machine translation (MT) systems, and whether a discourse parser is needed as a preprocessor to explicitate implicit DCs. Based on the manual annotation and alignment of 7266 pairs of disc...
متن کامل